Reinforcement learning

Results: 1147



#Item
181Machine learning / Multi-armed bandit / Stochastic optimization / Reinforcement learning / Pi / Algorithm

From Ads to Interventions: Contextual Bandits in Mobile Health Ambuj Tewari and Susan A. Murphy Abstract The first paper on contextual bandits was written by Michael Woodroofe inbut the term “contextual bandi

Add to Reading List

Source URL: dept.stat.lsa.umich.edu

Language: English - Date: 2016-06-30 18:36:20
182Machine learning / Computational linguistics / User interface techniques / Multimodal interaction / User interfaces / Reinforcement learning / Apprenticeship learning / Computational learning theory / Speech recognition / Intelligent agent / Dialog system / Dialog manager

Inverse Reinforcement Learning for Interactive Systems∗ [Extended Abstract] Olivier Pietquin SUPELEC - UMIGeorgiaTech-CNRS) 2 rue Edouard BelinMetz - France

Add to Reading List

Source URL: www.ilhaire.eu

Language: English - Date: 2013-10-03 05:33:46
183Markov processes / Markov models / Mathematical optimization / Stochastic control / Dynamic programming / Markov decision process / Beamforming / Reinforcement learning / Optimal control / Markov chain / Q-learning / Control theory

1 On Stochastic Feedback Control for Multi-antenna Beamforming: Formulation and Low-Complexity Algorithms Sun Sun, Min Dong, and Ben Liang

Add to Reading List

Source URL: www.comm.utoronto.ca

Language: English - Date: 2014-05-05 14:44:36
184Operations research / Machine learning / Belief revision / Reinforcement learning / Dynamic programming / Markov decision process / Mathematical optimization / Supervised learning

Boosted Bellman Residual Minimization Handling Expert Demonstrations Bilal Piot1,2 , Matthieu Geist1,2 , Olivier Pietquin3 1 3

Add to Reading List

Source URL: www.metz.supelec.fr

Language: English - Date: 2014-07-15 03:12:51
185Machine learning / Artificial intelligence / Computational neuroscience / Robotics / Artificial neural networks / Reinforcement learning / Metalearning / Dalle Molle Institute for Artificial Intelligence Research / Q-learning / Robot learning / Reinforcement / Cognitive robotics

Neural Dynamics and Reinforcement Learning Presented By: Matthew Luciw DFT SUMMER SCHOOL, 2013

Add to Reading List

Source URL: roboticsschool.ini.rub.de

Language: English - Date: 2013-10-09 09:31:30
186Estimation theory / Reinforcement learning / Fisher information / Likelihood function

Policy Gradient Coagent Networks Philip S. Thomas Department of Computer Science University of Massachusetts Amherst Amherst, MA 01002

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2013-11-16 15:49:43
187Artificial neural networks / Computational neuroscience / Cybernetics / Belief revision / Reinforcement learning / Recurrent neural network / Q-learning / Long short-term memory / DQN / Markov decision process / Sepp Hochreiter / Artificial intelligence

Language Understanding for Text-based Games using Deep Reinforcement Learning Karthik Narasimhan∗ CSAIL, MIT

Add to Reading List

Source URL: www.emnlp2015.org

Language: English - Date: 2015-09-04 01:25:56
188Estimation theory / Statistical theory / Statistical inference / Robust statistics / Least squares / Estimator / Bias of an estimator / Efficiency / Mean squared error / L-estimator / Consistent estimator / Statistics

Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning Philip S. Thomas Dhruva Tirumala Emma Brunskill Carnegie Mellon University

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2016-06-06 11:57:32
189Machine learning / Behaviorism / Belief revision / Reinforcement learning / AIXI / Reinforcement / Ergodicity / Learning

General Reinforcement Learning Jan Leike Future of Humanity Institute University of Oxford 9 June 2016

Add to Reading List

Source URL: intelligence.org

Language: English - Date: 2016-06-10 12:39:29
190Belief revision / Conference on Computer Vision and Pattern Recognition / Artificial neural network / Activity recognition / Object detection / Reinforcement learning / Image segmentation

End-to-end Learning of Action Detection from Frame Glimpses in Videos Serena Yeung1 , Olga Russakovsky1,2 , Greg Mori3 , Li Fei-Fei1 1 Stanford University, 2 Carnegie Mellon University, 3 Simon Fraser University

Add to Reading List

Source URL: vision.stanford.edu

Language: English - Date: 2016-04-13 14:34:27
UPDATE